An Approach to Automatic Indexing of Scientific Publications in High Energy Physics for Database SPIRES HEP

نویسندگان

  • A. V. Averin
  • L. A. Vassilevskaya
چکیده

We introduce an approach to automatic indexing of e-prints based on a patternmatching technique making extensive use of an Associative Patterns Dictionary (APD), developed by us. Entries in the APD consist of natural language phrases with the same semantic interpretation as a set of keywords from a controlled vocabulary. The method also allows to recognize within e-prints formulae written in TEX notations that might also appear as keywords. We present an automatic indexing system, AUTEX, which we have applied to keyword index e-prints in selected areas in high energy physics (HEP) making use of the DESY-HEPI thesaurus as a controlled vocabulary.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Physicists Thriving with Paperless Publishing Heath

The Stanford Linear Accelerator Center (SLAC) and Deutsches Elektro-nen Synchrotron (DESY) libraries have been comprehensively cataloguing the High Energy Particle Physics (HEP) literature online since 1974. The core database, SPIRES-HEP, now indexes over 400,000 research articles, with almost 50% linked to fulltext electronic versions (this site now has over 15 000 hits per day). This database...

متن کامل

Information Resources in High-Energy Physics

Access to previous results is of paramount importance in the scientific process. Recent progress in information management focuses on building e-infrastructures for the optimization of the research workflow, through both policy-driven and user-pulled dynamics. For decades, High-Energy Physics (HEP) has pioneered innovative solutions in the field of information management and dissemination. In l...

متن کامل

INSPIRE: Realizing the Dream of a Global Digital Library in High-Energy Physics

High-Energy Physics (HEP) has a long tradition in pioneering infrastructures for scholarly communication, and four leading laboratories are now rolling-out the next-generation digital library for the field: INSPIRE. This is an evolution of the extraordinarily successful, 40-years old SPIRES database. Based on the Invenio software, INSPIRE already provides seamless access to almost 1 million rec...

متن کامل

INSPIRE: A new scientific information system for HEP

The status of high-energy physics (HEP) information systems has been jointly analyzed by the libraries of CERN, DESY, Fermilab and SLAC. As a result, the four laboratories have started the INSPIRE project – a new platform built by moving the successful SPIRES features and content, curated at DESY, Fermilab and SLAC, into the open-source CDS Invenio digital library software that was developed at...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره cs.IR/0211041  شماره 

صفحات  -

تاریخ انتشار 2002